Name | Version | Summary | date |
intel-optimization-for-horovod |
0.28.1.6 |
IntelĀ® Optimization for Horovod* is the distributed training framework for TensorFlow* and PyTorch*. |
2024-12-19 00:54:16 |
graphistry |
0.35.2 |
A visual graph analytics library for extracting, transforming, displaying, and sharing big graphs with end-to-end GPU acceleration |
2024-12-13 21:05:09 |
cars-forge |
1.1.1 |
Create an on-demand/spot fleet of single or cluster EC2 instances. |
2024-12-13 16:41:28 |
hopsworks |
4.1.4 |
Hopsworks Python SDK to interact with Hopsworks Platform, Feature Store, Model Registry and Model Serving |
2024-12-10 08:48:13 |
yaetos |
0.12.4 |
Write data & AI pipelines in (SQL, Spark, Pandas) and deploy them to the cloud, simplified |
2024-12-01 22:03:05 |
h3spark |
0.0.2 |
Lightweight pyspark wrapper for h3-py |
2024-11-27 17:01:11 |
cspark |
0.1.10 |
A Python SDK for interacting with Coherent Spark APIs |
2024-11-14 20:51:09 |
glue-utils |
0.9.1 |
Reusable utilities for working with Glue PySpark jobs |
2024-11-14 10:57:20 |
pysail |
0.1.7 |
Sail Python library |
2024-11-02 01:35:01 |
lakehouse-engine |
1.23.0 |
A configuration-driven Spark framework serving as the engine for several lakehouse algorithms and data flows. |
2024-10-28 14:58:58 |
hsfs |
3.7.9 |
HSFS: An environment independent client to interact with the Hopsworks Featurestore |
2024-10-23 10:47:07 |
pyspark-prometheus |
0.1.3 |
Prometheus instrumentation for Spark Streaming metrics. |
2024-10-20 21:55:35 |
nanocube |
0.2.1 |
Lightning fast OLAP-style point queries on Pandas DataFrames. |
2024-10-14 21:50:18 |
ocean-spark-airflow-provider |
1.1.4 |
Apache Airflow connector for Ocean for Apache Spark |
2024-09-30 16:04:35 |
sparkly-em |
0.1.0 |
Sparkly is a TF/IDF top-k blocking for entity matching system built on top of Apache Spark and PyLucene. |
2024-09-26 21:27:44 |
datespan |
0.2.9 |
Effortless date span parsing and management. |
2024-09-25 07:28:58 |
datespanlib |
0.1.8 |
A library for handling date spans. |
2024-09-21 09:01:10 |
Spark-df-Cleaner |
0.0.5 |
spark dataframe cleaner |
2024-09-03 14:40:41 |
Spooq |
3.4.2 |
Spooq is a PySpark based helper library for ETL data ingestion pipeline in Data Lakes. |
2024-08-08 13:28:24 |
chispa |
0.10.1 |
Pyspark test helper library |
2024-07-31 21:06:41 |